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ABSTRACT 

In this paper, we argue that type inferenc- 
ing incorrectly implements appropriate- 
ness specifications for typed feature struc- 
tures, promote a combination of type res- 
olution and unfiUing as a correct and ef- 
ficient alternative, and consider the ex- 
pressive limits of this alternative approach. 
Throughout, we use feature cooccurence 
restrictions as illustration and linguistic 
motivation. 



1 INTRODUCTION 

Unification formalisms may be either un- 
typed (dcgs, patr-ii, lfg) or typed 
(hpsg). a major reason for adding types 
to a formalism is to express restrictions 
on feature cooccurences as in GPSG [6] 
in order to rule out nonexistant types 
of objects. For example, there are no 
verbs which have the feature -|-N. The 
simplest way to express such restrictions 
is by means of an appropriateness par- 
tial function Approp:Type X Feat ^ Type. 
With such an appropriateness specifica- 
tion many such restrictions may be ex- 
pressed, though no restrictions involving 
reentrancies may be expressed. 

In this paper, we will first in §2 survey 
the range of type constraints that may be 
expressed with just a type hierarchy and 
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an appropriateness specification. Then in 
§3, we discuss how such type constraints 
may be maintained under unification as 
exemplified in the natural language pars- 
ing/generation system Troll [8].^ Unlike 
previous systems such as ALE, Troll does 
not employ any type inferencing. Instead, 
a limited amount of named disjunction 
([13], [5], [7]) is introduced to record type 
resolution possibilities. The amount of dis- 
junction is also kept small by the technique 
of unfiUing described in [10]. This strategy 
actually maintains appropriateness condi- 
tions in some cases in which a type in- 
ferencing strategy would fail. Finally, in 
§4, we discuss the possibilities for gener- 
alizing this approach to handle a broader 
range of constraints, including constraints 
involving reentrancies. 

2 APPROPRIATENESS 
FORMALISMS 

As discussed in Gerdemann & King [9], 
one can view appropriateness conditions as 
defining GPSG style feature cooccurence re- 
strictions (FCRs). In [9], we divided FCRs 
into conjunctive and disjunctive classes. A 
conjunctive FCR is a constraint of the fol- 
lowing form: 

if an object is of a certain kind 
then it deserves certain features 
with values of certain kinds 

An FCR stating that a verb must have V 
and N features with values -|- and — re- 
spectively is an example of a conjunctive 
FCR. A disjunctive FCR is of the form: 



^ The Troll System was implemented in Quintus 
Prolog by Dale Gerdemann and Thilo Gotz. 



if an object is of a certain kind 
then it deserves certain features 
with values of certain kinds, 
or it deserves certain (perhaps 
other) features with values of 
certain (perhaps other) kinds, 
or . . . 

or it deserves certain (perhaps 
other) features with values of 
certain (perhaps other) kinds 

For example, the following FCR stating 
that inverted verbs must be auxiliaries is 
disjunctive: a verb must have the features 
INV and AUX with values + and +, — and 
+ , or — and — respectively. 

Both of these forms of FCRs may be 
expressed in a formalism employing finite 
partial order (Type, C) of types under sub- 
sumption, a finite set Feat of features, 
and an appropriateness partial function 
Approp:Type X Feat ^ Type. Intuitively, 
the types formalize the notion of kinds of 
object, t \Z t' iS each object of type t' is 
also of type t, and Approp(i, /) = t' iff each 
object of type t deserves feature / with a 
value of type t' . We call such a formal- 
ism an appropriateness formalism. Car- 
penter's ALE and Gerdemann and Gotz's 
Troll are examples of implementations of 
appropriateness formalisms. 

How an appropriateness formalism en- 
codes a conjunctive FCR is obvious, but 
how it encodes a disjunctive FCR is less 
so. An example illustrates best how it is 
done. Suppose that FCR p states that ob- 
jects of type t deserve features / and g, 
both with boolean values and furthermore 
that the values of / and g must agree, p 
is the disjunctive FCR 

if an object is of type t 
then it deserves / with value -|- 
and g with value -|-, 
or it deserves / with value — 
and g with value — 

To encode p, first introduce subtypes, t' 
and t" of t (t \Z t' , t"), one subtype for 
each disjunct in the consequent of p. Then 
encode the feature/ value conditions in the 
first disjunct by putting Approp(i',/) = -|- 



and Approp(i',^) = -|-, and encode the fea- 
ture/value conditions in the second dis- 
junct by putting Approp(i",/) = — and 
Approp(f' ,£() = -.2 

This approach makes two important 
closed-world type assumptions about the 
types that subsume no other types (hence- 
forth species). First, the partition condi- 
tion states that for each type t, if an ob- 
ject is of type t then the object is of ex- 
actly one species subsumed by t. Second, 
the all-or-nothing condition states that for 
each species s and feature /, either every 
or no object of species s deserves feature 
/."^ An appropriateness formalism such as 
ALE ([2], [3]) that does not meet both con- 
ditions may not properly encode a disjunc- 
tive FCR. For example, consider disjunc- 
tive FCR p. An appropriateness formal- 
ism may not properly encode that t' and t" 
represent all and only the disjuncts in the 
consequent of p without the partition con- 
dition. An appropriateness formalism may 
not properly encode the feature/ value con- 
ditions demanded by each disjunct in the 
consequent of p without the all-or-nothing 
condition. 

As indicated above, ALE is an example 
of a formalism that does not meet both of 
these closed world assumptions. In ALE a 
feature structure is well-typed iff for each 
arc in the feature structure, if the source 
node is labelled with type t, the target 
node is labelled with type t' and the arc is 
labelled with feature / then Approp(i, /) C 
t' . Furthermore, a feature structure is 
well-typable iff the feature structure sub- 



^This example FCR is, for expository purposes, 
quite simple. The problem of expressing FCR's, 
however, is a real linguistic problem. As noted by 
Copestake et al. [4], it was impossible to express 
even the simplest forms of FCRs in their extended 
version of ALE. 

The basic principle of expressing FCRs also ex- 
tends to FCRs involving longer paths. For exam- 
ple, to ensure that for the type t, the path (fg) 
takes a value subsumed by s, one must first intro- 
duce the chain Approp(t, /) = u, Approp(M, g) = s. 
Such intermediate types could be introduced as 
part of a compilation stage. 

•^Note that these closed world assumptions are 
explicitly made in Pollard & Sag (1994) [14]. 



sumes a well-typed feature structure. In 
ALE, type inferencing is employed to en- 
sure that all feature structures are well- 
typable — in fact, all feature structures are 
well typed. Unfortunately, well-typability 
is not sufficient to ensure that disjunctive 
FCRs are satisfied. Consider, for exam- 
ple, our encoding of the disjunctive FCR p 
and suppose that Lp is the feature structure 
t[f : -|-,^ : — ]. (/J is well-typed, and hence 
trivially well-typable. Unfortunately, Lp vi- 
olates the encoded disjunctive FCR p. 

By contrast, the Troll system described 
in this paper has an effective algorithm for 
deciding well-formedness, which is based 
on the idea of efficiently representing dis- 
junctive possibilities within the feature 
structure. Call a well-typed feature struc- 
ture in which all nodes are labelled with 
species a resolved feature structure and 
call a set of resolved feature structures that 
have the same underlying graph (that is, 
they differ only in their node labellings) 
a disjunctive resolved feature structure. 
We write J^S, TITS and VTZTS for the 
collections of feature structures, resolved 
feature structures and disjunctive resolved 
feature structures respectively. Say that 
F' e UTS is a resolvant oi F e TS ifl" 
F and F' have the same underlying graph 
and F subsumes F' . Let type resolution be 
the total function TZ: TS VTZTS such 
that TliF) is the set of all resolvants of F. 

Guided by the partition and all-or- 
nothing conditions. King [12] has formu- 
lated a semantics of feature structures and 
developed a notion of a satisfiable feature 
structure such that F G TS is satisfiable 
\^TZ{F) ^ 0. Gerdemann k King [9] have 
also shown that a feature structure meets 
all encoded FCRs iff the feature structure 
is satisfiable. The Troll system, which is 
based on this idea, effectively implements 
type resolution. 

Why does type resolution succeed where 
type inferencing fails? Consider again the 
encoding of p and the feature structure 
Lp. Loosely speaking, the appropriate- 
ness specifications for type t encode the 
part of p that states that an object of 



type t deserves features / and g, both 
with boolean values. However, the ap- 
propriateness specifications for the speci- 
ate subtypes t' and t" of type t encode 
the part of p that states that these val- 
ues must agree. Well-typability only con- 
siders species if forced to. In the case 
of (/J, well-typability can be established 
by considering type t alone, without the 
partition condition forcing one to find a 
well-typed species subsumed by t. Conse- 
quently, well-typability overlooks the part 
of p exclusively encoded by the appropri- 
ateness specifications for t' and t" . Type 
resolution, on the other hand, always con- 
siders species. Thus, type resolving Lp 
cannot overlook the part of p exclusively 
encoded by the appropriateness specifica- 
tions for t' and t" . 

3 MAINTAINING 

APPROPRIATENESS 
CONDITIONS 

How may these VTZTS be used in an im- 
plementation? A very important prop- 
erty of the class of VTZTS is that they 
are closed under unification, i.e., if F and 
F' e VUTS then F U F' e VTZTS.'^ 
Given this property, it would in princi- 
ple be possible to use the disjunctive re- 
solved feature structures in an implemen- 
tation without any additional type infer- 
encing procedure to maintain satisfiabil- 
ity. It would, of course, not be very effi- 
cient to work with such large disjunctions 
of feature structures. These disjunctions 
of feature structures, however, have a sin- 
gular property: all of the disjuncts have 
the same shape. The disjuncts differ only 
in the types labeling the nodes. This prop- 
erty allows a disjunctive resolved feature 
structure to be represented more efficiently 
as a single untyped feature structure plus 

*In fact, it can be shown that if F and F' G 
TS then 7^(_F) U 7^(_F') = 11{F U F'). Unification 
of sets of feature structures is defined here in the 
standard way: 5 U 5' = {_F | _F' G 5 and F" G S' 
and F = F'UF"}. 



a set of dependent node labelings, which 
can be further compacted using named dis- 
junction as in Gerdemann [7], Dorre & 
Eisele [5] or Maxwell & Kaplan [13]. 

For example, suppose we type resolve 
the feature structure t[f : bool,g : bool] us- 
ing our encoding of p. One can easily see 
that this feature structure has only two re- 
solvants, which can be collapsed into one 
feature structure with named disjunction 
as shown below: 
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We now have a reasonably compact rep- 
resentation in which the FCR has been 
translated into a named disjunction. How- 
ever, one should note that this disjunc- 
tion is only present because the features 
/ and g happen to be present. These fea- 
tures would need to be present if we were 
enforcing Carpenter's [2] total well typing 
requirement, which says that features that 
are allowed must be present. But total well 
typing is, in fact, incompatible with type 
resolution, since there may well be an infi- 
nite set of totally well typed resolvants of a 
feature structure. For example, an under- 
specified list structure could be resolved to 
a list of length 0, a list of length 1, etc. 

Since total well typing is not required, 
we may as well actively unfiU redundant 
features.^ In this example, if the / and g 
features are removed, we are left with the 
simple disjunction which is equiv- 

alent to the ordinary type t.^ Thus, in this 
case, no disjunction at all is required to en- 
force the FCR. All that is required is the 
assumption that t will only be extended 
by unifying it with another (compacted) 
member of VTZTS. 

^Intuitively, features are redundant if their val- 
ues are entirely predictable from the appropriate- 
ness specification. See Gotz [10], Gerdemann [8] 
for a more precise formulation. 

^In this case, it would also have been possible 
to unfiU the original feature structure before re- 
solving. Unfortunately, however, this is not always 
the case, as can be seen in the following example: 
Af ■■ +] {*'[/ : +]} 



This, however, was a simple case in 
which all of the named disjunction could 
be removed. It would not have been pos- 
sible to remove the features / and g if 
these features had been involved in reen- 
trancies or if these features had had com- 
plex values. In general, however, our expe- 
rience has been that even with very com- 
plex type hierarchies and feature struc- 
tures for HPSG, very few named disjunc- 
tions are introduced.'' Thus, unification is 
generally no more expensive than unifica- 
tion with untyped feature structures. 

4 CONCLUSIONS 

We have shown in this paper that the kind 
of constraints expressible by appropriate- 
ness conditions can be implemented in a 
practical system employing typed feature 
structures and unification as the primary 
operation on feature structures. But what 
of more complex type constraints involv- 
ing reentrancies? Introducing reentrancies 
into constraints allows for the possibility of 
defining recursive types, such as the defi- 
nition of append in [1]. Clearly the re- 
solvants of such a recursive type could not 
be precompiled as required in Troll. 

One might, nevertheless, consider al- 
lowing reentrancy-constraints on non- 
recursively defined types. A problem still 
arises; namely, if the resolvants of a feature 
structure included some with a particu- 
lar reentrancy and some without, then the 
condition that all resolvants have the same 
shape would no longer hold. One would 
therefore need to employ a more com- 
plex version of named disjunction ([13], 
[5], [11]). It is questionable whether such 
additional complexity would be justified 
to handle this limited class of reentrancy- 
constraints. 

It seems then, that the class of con- 
straints that can be expressed by appro- 

'^Our experience is derived primarily from test- 
ing the Troll system on a rather large grammar 
for German partial verb phrases, which was writ- 
ten by Erhard Hinrichs and Tsuneko Nakazawa 
and implemented by Detmar Meurers. 



priateness conditions corresponds closely 
to the class of constraints that can be effi- 
ciently precompiled. We take this as a jus- 
tification for appropriateness formalisms 
in general. It makes sense to abstract out 
the efficiently processable constraints and 
then allow another mechanism, such as at- 
tachments of definite clauses, to express 
more complex constraints. 
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